Offensive speech detection with irony mechanism

doi:10.1 1772/j.issn.1001-9081.2023040533

Abstract

Abstract: Offensive speech spread on the Internet seriously disrupts the normal network order and destroys the network environment for healthy communication. Existing detection technologies are more focused on the distinctive features in the text, and it is difficult to discover more subtle attack methods. In response to the above problems, an offensive speech detection model BSWD (BERT-based Sarcasm and Word Detection) incorporating irony mechanism is proposed. First, a model Sarcasm-BERT based on the irony mechanism is proposed to detect semantic conflicts in speech; secondly, a fine-grained vocabulary offensive feature extraction model WordsDetect is proposed to detect offensive words in speech; finally, the BSWD model is obtained by fusing the two models to improve the detection accuracy.The experimental results show that the accuracy, precision, recall, and F1 score indicators of the proposed model are generally improved by more than 2%, compared with the BERT(Bidirectional Encoder Representation from Transformers) and HateBERT methods, which significantly improves the detection performance and can better detect implicit offensive speech. Compared with the SKS (Sentiment Knowledge Sharing) and BiCHAT (Bidirectional Long Short-Term Memory with deep Convolution neural network and Hierarchical ATtention) methods, it has stronger generalization ability and robustness. The results of the ablation experiment show that the proposed model has an improvement of 0.5%~1.5%, compared with WordsDetect and Sarcasm-BERT in terms of precision, recall, and F1 score indicators, successfully integrating the advantages of each part of the work.

Key words: irony detection, offensive speech detection, fine-grained feature, obscure attack, attention mechanism

摘要： 互联网上散布的攻击性言论严重扰乱了正常网络秩序，破坏了健康交流的网络环境。现有的检测技术更专注于文本中的鲜明特征，难以发现更加隐晦的攻击方式。针对上述问题，提出了融合反讽机制的攻击性言论检测模型BSWD (BERT-based Sarcasm and Word Detection)。首先，提出基于反讽机制的模型Sarcasm-BERT，以检测言论中的语义冲突；其次，提出细粒度词汇攻击性特征提取模型WordsDetect，检测言论中的攻击性词汇；最后，融合两种模型得到BSWD模型，提高检测精度。实验结果表明，与BERT(Bidirectional Encoder Representation from Transformers)、HateBERT模型相比，所提模型的准确率、精确率、召回率和F1分数指标普遍提升2%以上，显著提高了检测性能，更能发现隐含的攻击性言论；同时，与SKS(Sentiment Knowledge Sharing)、BiCHAT(Bidirectional Long Short-Term Memory BiLSTM with deep Convolution neural networkCNN and Hierarchical ATtention)模型相比，具有更强的泛化能力和鲁棒性。消融实验结果表明，与WordsDetect、Sarcasm-BERT相比，所提模型的精确率、召回率和F1分数指标都有0.5%~1.5%的提升，成功集成了每部分工作的优势能力。

关键词: 反讽检测, 攻击性言论检测, 细粒度特征, 隐晦攻击, 注意力机制

CLC Number:

TP391.1

王海涵朱焱. 融合反讽机制的攻击性言论检测[J]. 《计算机应用》唯一官方网站, DOI: 10.1 1772/j.issn.1001-9081.2023040533.

[1]	Chun XU, Shuangyan JI, Huan MA, Enwei SUN, Mengmeng WANG, Mingyu SU. Consultation recommendation method based on knowledge graph and dialogue structure [J]. Journal of Computer Applications, 2025, 45(4): 1157-1168.
[2]	Shiyue GUO, Jianwu DANG, Yangping WANG, Jiu YONG. 3D hand pose estimation combining attention mechanism and multi-scale feature fusion [J]. Journal of Computer Applications, 2025, 45(4): 1293-1299.
[3]	Jie HU, Qiyang ZHENG, Jun SUN, Yan ZHANG. Multi-label classification model based on multi-label relational graph and local dynamic reconstruction learning [J]. Journal of Computer Applications, 2025, 45(4): 1104-1112.
[4]	Liqin WANG, Zhilei GENG, Yingshuang LI, Yongfeng DONG, Meng BIAN. Open-world knowledge reasoning model based on path and enhanced triplet text [J]. Journal of Computer Applications, 2025, 45(4): 1177-1183.
[5]	Liwei ZHANG, Quan LIANG, Yutao HU, Qiaole ZHU. Channel shuffle attention mechanism based on group convolution [J]. Journal of Computer Applications, 2025, 45(4): 1069-1076.
[6]	Kunyuan JIANG, Xiaoxia LI, Li WANG, Yaodan CAO, Xiaoqiang ZHANG, Nan DING, Yingyue ZHOU. Boundary-cross supervised semantic segmentation network with decoupled residual self-attention [J]. Journal of Computer Applications, 2025, 45(4): 1120-1129.
[7]	Haijun GENG, Yun DONG, Zhiguo HU, Haotian CHI, Jing YANG, Xia YIN. Encrypted traffic classification method based on Attention-1DCNN-CE [J]. Journal of Computer Applications, 2025, 45(3): 872-882.
[8]	Tianqi ZHANG, Shuang TAN, Xiwen SHEN, Juan TANG. Image watermarking method combining attention mechanism and multi-scale feature [J]. Journal of Computer Applications, 2025, 45(2): 616-623.
[9]	Dixin WANG, Jiahao WANG, Min LI, Hao CHEN, Guangyao HU, Yu GONG. Abnormal attack detection for underwater acoustic communication network [J]. Journal of Computer Applications, 2025, 45(2): 526-533.
[10]	Jiayang GUI, Shunji WANG, Zhengkang ZHOU, Jiashan TANG. Tunnel foreign object detection algorithm based on improved YOLOv8n [J]. Journal of Computer Applications, 2025, 45(2): 655-661.
[11]	Haiteng MENG, Xiaole ZHAO, Tianrui LI. Lightweight image super-resolution reconstruction based on asymmetric information distillation network [J]. Journal of Computer Applications, 2025, 45(2): 601-609.
[12]	Yan LI, Guanhua YE, Yawen LI, Meiyu LIANG. Enterprise ESG indicator prediction model based on richness coordination technology [J]. Journal of Computer Applications, 2025, 45(2): 670-676.
[13]	Qijian CAI, Wei TAN. Semantic graph enhanced multi-modal recommendation algorithm [J]. Journal of Computer Applications, 2025, 45(2): 421-427.
[14]	Lifang WANG, Jingshuang WU, Pengliang YIN, Lihua HU. Action recognition algorithm based on attention mechanism and energy function [J]. Journal of Computer Applications, 2025, 45(1): 234-239.
[15]	Pengcheng SONG, Lijun GUO, Rong ZHANG. Weakly supervised video anomaly detection with local-global temporal dependency [J]. Journal of Computer Applications, 2025, 45(1): 240-246.

Offensive speech detection with irony mechanism

融合反讽机制的攻击性言论检测

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics